Statistical Properties of Share Volume Traded in Financial Markets 
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We quantitatively investigate the ideas behind the often-expressed adage 'it takes volume to move 
stock prices', and study the statistical properties of the number of shares traded QAt for a given 
stock in a fixed time interval At. We analyze transaction data for the largest 1000 stocks for the 
two-year period 1994-95, using a database that records every transaction for all securities in three 
major US stock markets. We find that the distribution P(QAt) displays a power-law decay, and 
that the time correlations in Qai display long-range persistence. Further, we investigate the relation 
between QAt and the number of transactions N&t in a time interval At, and find that the long-range 
correlations in Qai are largely due to those of NAt- Our results are consistent with the interpretation 
that the large equal-time correlation previously found between QAt and the absolute value of price 
change \GAt\ (related to volatility) are largely due to NAt- 
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The distinctive statistical properties of financial time 
series are increasingly attracting the interest of physi- 
cists In particular, several empirical studies have de- 
termined the scale-invariant behavior of both the distri- 
bution of price changes and the long-range correlations 
in the absolute values of price changes |^] . It is a common 
saying that 'it takes volume to move stock prices'. This 
adage is exemplified by the market crash of 19 October 
1987, when the Dow Jones Industrial Average dropped 
22.6% accompanied by an estimated 6 x 10 8 shares that 
changed hands on the New York Stock Exchange alone. 
Indeed, an important quantity that characterizes the dy- 
namics of price movements is the number of shares QAt 
traded (share volume) in a time interval At. Accordingly, 
in this paper we quantify the statistical properties of Qa* 
and the relation between QAt and the number of trades 
A^At in At. To this end, we select 1000 largest stocks 
from a database j| recording all transactions for all US- 
stocks, and analyze transaction data for each stock for 
the 2-year period 1994-95. 

First, we consider the time series of QAt for one 
stock, which shows large fluctuations that are strikingly 
non-Gaussian [Fig. la]. Figure lb shows, for each of 
four actively-traded stocks, the probability distributions 
P{QAt) which are consistent with a power law decay, 
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When we extend this analysis to the each of the 1000 
stocks [Fig. lc,d], we obtain an average value for the ex- 
ponent A = 1.7 ± 0.1, within the Levy stable domain 
< A < 2. 

We next analyze correlations in QAt- We consider the 
family of correlation functions {[QAt{t)] a [QAt{t + t)] q ), 
where the parameter a (< A/2) is required to ensure that 



the correlation function is well defined. Instead of an- 
alyzing the correlation function directly, we apply de- 
trended fluctuation analysis || , which has been success- 
fully used to study long-range correlations in a wide range 
of complex systems ||. We plot the detrended fluctua- 
tion function F(t) as a function of the time scale r. Ab- 
sence of long-range correlations would imply F(t) ~ r 5 , 
whereas F(t) ~ t s with 0.5 < S < 1 implies power-law 
decay of the correlation function, 
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For the parameter a = 0.5, we obtain the average value 
6 = 0.83 ± 0.02 for the 1000 stocks [Fig. 2a,b]; so from 
Eq. (g), k = 0.34 ±0.04 0. 

To investigate the reasons for the observed power-law 
tails of P{QAt) and the long-range correlations in QAt, 
we first note that 
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is the sum of the number of shares qi traded for all 
i = 1, . . . , A'a* transactions in At. Hence, we next ana- 
lyze the statistical properties of qi. Figure 3a shows that 
the distribution P(q) for the same four stocks displays a 
power-law decay P(q) ~ l/g 1+ ^. When we extend this 
analysis to each of the 1000 stocks, we obtain the average 
value C = 1-53 ± 0.07 [Fig. 3b]. 

Note that £ is within the stable Levy domain 
< C < 2, suggesting that P(q) is a positive (or 
one-sided) Levy stable distribution @||. Therefore, the 
reason why the distribution P(Qa*) has similar asymp- 
totic behavior to P{q), is that P(q) is Levy stable, and 
QAt is related to q through Eq. (||). Indeed, our estimate 
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of C is comparable within error bounds to our estimate of 
A. We also investigate if the g, are correlated in "trans- 
action time", defined by i, and we find only "weak" cor- 
relations (the analog of 6 has a value = 0.57 ± 0.04, close 
to 0.5). 

To confirm that P{q) is Levy stable, we also exam- 
ine the behavior of Q n = We first analyze 
the asymptotic behavior of P(Q n ) for increasing n. For 
a Levy stable distribution, n 1 ^ P([Q„ — (Qn)]/n 1 ^) 
should have the same functional form as P(q), where 
(Qn) = n (q) and (. . .) denotes average values. Figure 
4a shows that the distribution P(Q n ) retains its asymp- 
totic behavior for a range of n — consistent with a Levy 
stable distribution. We obtain an independent estimate 
of the exponent £ by analyzing the scaling behavior of 
the moments (i r (n) = (|Q„ — (Q n )\ r ), where r < A [ ^0| . 
For a Levy stable distribution |/t T .(n)] 1 / r ~ n 1 ^. Hence, 
we plot [/i r (n)] 1 / r as a function of n [Fig. 4b, c] and ob- 
tain an inverse slope of £ = 1.45 ±0.03 — consistent with 
our previous estimate of £ ]lT| . 

Since the qi have only weak correlations (the analog of 
S has the value = 0.57), we ask how QAt = J2f=i 1i 
can show much stronger correlations (5 — 0.83). To 
address this question, we note that (i) N& t is long- 
range correlated jL4j, and (ii) P(q) is consistent with a 
Levy stable distribution with exponent £, and therefore, 

N A { C P([Q A t-(q) N*. t ]/N$) should, from Eq. @, have 
the same distribution as any of the qi. Thus, we hypothe- 
size that the dependence of Qa* on N At can be separated 
by defining \ = [Qm - (q) NAt\/N A {^, where x is a one- 
sided Levy-distributed variable with zero mean and ex- 
ponent £ fill- To test this hypothesis, we first analyze 
P(x) and find similar asymptotic behavior to P{Q A t) 
[Fig. 4d]. Next, we analyze correlations in \ an d find 
only weak correlations [Fig. 4e,f] — implying that the 
correlations in Q At are largely due to those of N& t - 

An interesting implication is an explanation for the 
previously-observed [ p~2|Jl3| equal-time correlations be- 
tween <5a* and volatility V A t , which is the local standard 
deviation of price changes G A t ■ Now V At — W&t V^At, 
since G A t depends on A^a* through the relation G A t — 
W A t VN A t e , where e is a Gaussian-distributed variable 
with zero mean and unit variance and W At is the vari- 
ance of price changes due to all N At transactions in 
At 0. Consider the equal-time correlation, (Q At V At ), 
where the means are subtracted from Q A t and V A t ■ Since 
Q At depends on N At through Q At = (q)N At + N A { C x, 
and the equal-time correlations (N At W At ), (N At x), and 
(WAt x) are small (correlation coefficient of the order 
of w 0.1), it follows that the equal-time correlation 

(QAt VAt) oc (N^f) - {NAt) i N A ^), which is positive due 
to the Cauchy-Schwartz inequality. Therefore, (QAt VAt) 
is large because of NAt- 
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FIG. 1. (a) Number of shares traded |l5| for Exxon Cor- 
poration (upper panel) for an interval At = 15 min com- 
pared to a series of Gaussian random numbers with the 
same mean and variance (lower panel). (b) Probability 
density function P{QAt) for 4 actively-traded stocks Exxon 
Corp., General Electric Co., Coca Cola Corp., and A T 
& T Corp., shows an asymptotic power-law behavior char- 
acterized by an exponent 1 + A. Hill's method jl(| gives 
A = 1.87 ± 0.14, 2.10 ± 0.17, 1.91 ± 0.20, and 1.71 ± 0.09 re- 
spectively, (c) P(QAt) for 1000 stocks on a log-log scale. 
To choose compatible sampling time intervals At, we first 
partition the 1000 companies studied into six groups Jl4| de- 
noted I - VI, based upon the average time interval between 
trades St. For each group, we choose At > 10 St, to en- 
sure that each interval has a sufficient N^t- Thus we choose 
At = 15, 39, 65, 78, 130 and 390 min for groups I - VI respec- 
tively, each containing « 150 companies. Since the average 
value of QAt differs from one company to the other, we nor- 
malize QAt by its average. Each symbol shows the probability 
density function of normalized QAt for all companies that be- 
long to each group. Power-law regressions on the density func- 
tions of each group yield the mean value A = 1.78 ± 0.07. (d) 
Histogram of exponents Xi for i = 1, . . . , 1000 stocks obtained 
using Hill's estimator shows an approximately Gaussian 
spread around the average value A = 1.7 ± 0.1. 




(b) 1000 stocks 
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FIG. 2. (a) Detrended fluctuation function F(t) for 
(QAt) a for a = 0.5 ||, averaged for all stocks within each 
group (I- VI) as a function of the time lag r. F(t) for a time 
series is defined as the % 2 deviation of a linear fit to the in- 
tegrated time series in a box of size r JEJ. An uncorrelated 
time series displays to F(r) ~ r s , where 8 — 0.5, whereas 
long-range correlated time series display values of exponent in 
the range 0.5 < 8 < 1. In order to detect genuine long-range 
correlations, the U-shaped intraday pattern for QAt is re- 
moved by dividing each QAt by the intraday pattern || . (b) 
Histogram of <5 obtained by fitting F(t) with a power-law for 
each of the 1000 companies. We obtain a mean value of the 
exponent 0.83 ± 0.02. 
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(a) 4 Stocks 
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FIG. 3. (a) Probability density function of the num- 
ber of shares g; traded, normalized by the average value, 
for all transactions for the same four actively-traded 
stocks. We find an asymptotic power-law behavior 
characterized by an exponent f. Fits yield values 
C = 1.87±0.13, 1.61±0.08, 1.66±0.05, 1.47±0.04, respectively 
for each of the 4 stocks, (b) Histogram of the values of £ ob- 
tained for each of the 1000 stocks using Hill's estimator Jl6| ], 
whereby we find the average value £ = 1.53 ± 0.07. 
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(e) Correlations in x 
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x (a) Scaling of Q„ 
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FIG. 4. (a) Probability distribution of Q n as a function of 
increasing n = 1, . . . , 256 apparently retains the same asymp- 
totic behavior, (b) Scaling of the r th moments /i r with in- 
creasing n for the same four stocks. The inverse slope of this 
line yields an independent estimate of the exponent f. We 
obtain £ = 1.43 ± 0.02, 1.35 ± 0.03, 1.42 ± 0.01, 1.41 ± 0.02 
respectively, (c) Histogram of exponents ( obtained by fit- 
ting a power-law to the equivalent of part (b) for all 1000 
stocks studied. We thus obtain a value £ = 1.45 ± 0.03 con- 
sistent with our previous estimate using Hill's estimator, (d) 
Histogram of slopes estimated using Hill's estimator for the 
scaled variable % = [Qa* — (q)NAt]/N^f compared to that of 
Qa*. We obtain a mean value 1.7 ± 0.1 for the tail exponent 
of X) consistent with our estimate of the tail exponent A for 
Qa*. (e) Detrended fluctuation function F(t) for %i where 
each symbol denotes an average of F(t) for all stocks within 
each group (I- VI as in Fig. 1). (f) Histogram of detrended 
fluctuation exponents for \- We obtain an average value for 
the exponent 0.61 ± 0.03 which indicates only weak correla- 
tions compared to the value of the exponent 8 — 0.83 ± 0.03 
for QAt- 
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